A More Powerful Two-Sample Test in High Dimensions using Random Projection
نویسندگان
چکیده
We consider the hypothesis testing problem of detecting a shift between the means of two multivariate normal distributions in the high-dimensional setting, allowing for the data dimension p to exceed the sample size n. Our contribution is a new test statistic for the two-sample test of means that integrates a random projection with the classical Hotelling T 2 statistic. Working within a high-dimensional framework that allows (p, n) → ∞, we first derive an asymptotic power function for our test, and then provide sufficient conditions for it to achieve greater power than other state-of-the-art tests. Using ROC curves generated from simulated data, we demonstrate superior performance against competing tests in the parameter regimes anticipated by our theoretical results. Lastly, we illustrate an advantage of our procedure with comparisons on a high-dimensional gene expression dataset involving the discrimination of different types of cancer.
منابع مشابه
RAPTT: An Exact Two-Sample Test in High Dimensions Using Random Projections
In1 high dimensions, the classical Hotelling’s T 2 test tends to have low power or becomes undefined due to singularity of the sample covariance matrix. In this paper, this problem is overcome by projecting the data matrix onto lower dimensional subspaces through multiplication by random matrices. We propose RAPTT (RAndom Projection T-Test), an exact test for equality of means of two normal pop...
متن کاملEvaluating E-Learning Maturity from the viewpoints of Medical Sciences Students
Introduction: Digitalization of education is considered as a major reforming in higher education. E-learning programs are increasingly seen as a way to reform in medical sciences education, giving access to ongoing learning and training without any time or geographical barriers. Technology is a powerful tool for effective teaching and deep learning. Therefore, the aim of this paper is evalua...
متن کاملSmall Sample Size in High Dimensional Space - Minimum Distance Based Classification
In this paper we present some new results concerning the classification in small sample high dimensional case. We discuss geometric properties of data structures in high dimensions. It is known that such a data form in high dimension an almost regular simplex even if co-variance structure of data is not unity. We restrict our attention to two class discrimination problems. It is assumed that ob...
متن کاملپیشبینی اضطراب امتحان دانشآموزان دبیرستانی بر اساس ابعاد کمالگرایی آنان
Abstract: The present study aimed to predict high school students ʼ Test Anxiety based on Perfectionism dimensions. The population of the study included junior high school students of humanistic sciences, science and mathematics in Tabriz. The sample consisted of 168 people who were selected by cluster random sampling method. The Spiel berger Anxiety Test and the Multidimensional Perfectionism...
متن کاملرابطه استعارهها و ابعاد شخصیتی درونگرایی/ برونگرایی آیسنک
The goal of the present research was to determine the relationship between Eysenck's "E" personality dimensions and a selection of metaphorical concepts. Researches in the past have emphasized personality and linguistic components in literal language. The present research investigated metaphor as a part of figurative language, and its relation with the two personality dimensions. The initial sa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011